Search CORE

17 research outputs found

The BBN TransTalk Speech-to-Speech Translation System

Author: David Stallard
Fred Choi
Jacob Devlin
Kriste Krstovski
Prem Natarajan
Ralf Meermeier
Rohit Prasad
Shankar Ananthakrishnan
Shirin Saleem
Publication venue: 'IntechOpen'
Publication date: 21/06/2011
Field of study

IntechOpen

Retention in pre-antiretroviral treatment care in a district of Karnataka, India: how well are we doing?

Author: Ananthakrishnan R
Bhargava N
Das M
Devi M
Kumar A M V
Kumar S
Nagaraja S B
Rewari B
Satyanarayana S
Shankar D
Shastri S
Zachariah R
Publication venue: International Union Against Tuberculosis and Lung Disease
Publication date: 21/12/2014
Field of study

MSF Field Research

AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

Author: Ananthakrishnan Shankar
FitzGerald Jack
Gupta Rahul
Hamza Wael
Khan Haidar
Natarajan Prem
Peris Charith
Prakash Chandana Satya
Rawls Stephen
Rosenbaum Andy
Rumshisky Anna
Soltan Saleh
Sridhar Mukund
Triefenbach Fabian
Tur Gokhan
Verma Apurv
Publication venue
Publication date: 03/08/2022
Field of study

In this work, we demonstrate that multilingual large-scale sequence-to-sequence (seq2seq) models, pre-trained on a mixture of denoising and Causal Language Modeling (CLM) tasks, are more efficient few-shot learners than decoder-only models on various tasks. In particular, we train a 20 billion parameter multilingual seq2seq model called Alexa Teacher Model (AlexaTM 20B) and show that it achieves state-of-the-art (SOTA) performance on 1-shot summarization tasks, outperforming a much larger 540B PaLM decoder model. AlexaTM 20B also achieves SOTA in 1-shot machine translation, especially for low-resource languages, across almost all language pairs supported by the model (Arabic, English, French, German, Hindi, Italian, Japanese, Marathi, Portuguese, Spanish, Tamil, and Telugu) on Flores-101 dataset. We also show in zero-shot setting, AlexaTM 20B outperforms GPT3 (175B) on SuperGLUE and SQuADv2 datasets and provides SOTA performance on multilingual tasks such as XNLI, XCOPA, Paws-X, and XWinograd. Overall, our results present a compelling case for seq2seq models as a powerful alternative to decoder-only models for Large-scale Language Model (LLM) training

arXiv.org e-Print Archive

Corrigendum: Exploring the dynamics of arrivals and prices volatility in onion (Allium cepa) using advanced time series techniques

Author: A. Aravinthkumar
Ashu Chandel
Hukam Chand
Neha Mishra
R. Kumaraperumal
Rakesh Kumar
Rakesh Kumar Gupta
S. Ananthakrishnan
S. R. Naffees Gowsar
S. Vishnu Shankar
Subhash Sharma
Publication venue: Frontiers Media S.A.
Publication date: 01/09/2023
Field of study

Directory of Open Access Journals

The Use of Mobile Health Technology in Promoting Infant Vaccine Adherence – A Health Technology Assessment

Author: A Ananthakrishnan
A Chahar
A Dang
J Sharma
K Kachroo
M Ameel
M Shankar
M Vsn
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

On the models for laminar flow tubular reactor

Author: Ananthakrishnan
Bischoff
Bischoff
Bosworth
Cleland
Danckwerts
Edward
Fan
Ferrell
Gupta
Horn
Houghton
Hulburt
Lauwerier
Nigam
Shankar Subramanian
Standart
Taylor
Wan
Wehner
Wissler
Wissler
Woodhead
Publication venue: 'Wiley'
Publication date
Field of study

Crossref